Customized audio content relating to an object of interest

ABSTRACT

A device/system and method for creating customized audio segments related to an object of interest are disclosed. The device and/or system can create an additional level of interaction with the object of interest by creating customized audio segments based on the identity of the object of interest and/or the user&#39;s interaction with the object of interest. Thus, the mobile device can create an interactive environment for a user interacting with an otherwise inanimate object.

FIELD OF THE INVENTION

At least one embodiment of the present invention pertains to creatingcustomized audio content, and more particularly, to creating customizedaudio content related to an object of interest identified in an imagestream.

BACKGROUND

Children's toys have progressed through the ages. The earliest toys, forexample, were made from materials found in nature, such as rocks,sticks, and clay. As civilization progressed toys became more complex.For example, thousands of years ago, Egyptian, Grecian, and Romanchildren played with dolls that had wigs and movable limbs which weremade from stone, pottery, wax, wood, or terracotta.

As technology advanced and civilization continued to progress, toys alsochanged. Whereas ancient toys were made from materials found in naturelike stone, wood, and grass, modern toys are often made from plastic,cloth, and synthetic materials. Ancient toys were often made by theparents and family of the children who used them, or by the childrenthemselves. Modern toys, by contrast, are often mass-produced and soldin stores.

This change in the nature of toys is exemplified by the advancementsthat have taken place in creating interactive toys. For example, dolls,one of the oldest and most universal of human toys dolls, have becomeincreasingly interactive. The earliest and most primitive dolls weresimple wooden carvings or bundles of grass. Egyptian dolls weresometimes jointed so that their limbs could move realistically. By theearly 19th century there were dolls that could say “mama”. Today thereare computerized dolls that can recognize and identify objects, thevoice of their owner, and choose among hundreds of pre-programmedphrases with which to respond.

However, current technology does not provide for customized audiosegments to be dynamically created in response to recognizing anidentity of a random toy and/or a user's interaction with the toy orother object of interest.

SUMMARY

The techniques introduced here provide for an interactive environmentincluding customized audio segments related to an object of interestcreated in response to a user's interaction with the object of interest.The techniques include capturing a stream of images that includes theobject of interest and locating the object of interest in an image ofthe image stream. The techniques further include identifying the objectof interest and associating the object of interest with an objectprofile that includes an audio profile and an image profile. Thetechniques introduced here further provide for creating customized audiosegments that are related to the object of interest.

The techniques introduced here further provide for a user to create anobject profile for an object of interest. Creation of the object profileincludes associating the object of interest with an audio profile and animage profile. The audio profile can be constructed by selecting from acollection of pre-recorded audio profiles, recording audio clips tocreate the audio profile, selecting a crowd-sourced audio profile,and/or using a computer generated audio profile.

In one embodiment, a mobile device, using the techniques introducedhere, can create an additional level of interaction with the object ofinterest by creating customized audio segments based on the identity ofthe object of interest and/or the user's interaction with the object ofinterest. Thus, the mobile device can create an interactive environmentfor a user interacting with an otherwise inanimate object.

Other aspects of the techniques summarized above will be apparent fromthe accompanying figures and from the detailed description whichfollows.

BRIEF DESCRIPTION OF THE DRAWINGS

One or more embodiments of the present invention are illustrated by wayof example and not limitation in the figures of the accompanyingdrawings, in which like references indicate similar elements.

FIG. 1 is a block diagram of an example system for creating customizedaudio content relating to an object of interest.

FIG. 2 is a block diagram of an example mobile device.

FIG. 3 is a block diagram of an example server system.

FIG. 4 is a flow diagram of a process for creating customized audiocontent relating to an object of interest.

FIG. 5 is a flow diagram of a process for creating an object profile foran object of interest.

DETAILED DESCRIPTION

References in this specification to “an embodiment”, “one embodiment”,or the like, mean that the particular feature, structure orcharacteristic being described is included in at least one embodiment ofthe present invention. Occurrences of such phrases in this specificationdo not necessarily all refer to the same embodiment.

FIG. 1 is a block diagram of an example system for creating customizedaudio content relating to an object of interest. In the exampleembodiment of FIG. 1, the system includes a mobile device 104 coupledwith a server 106 through a network 108. While functions performedaccording to the techniques introduced here may be described as beingperformed by either the mobile device 104 or the server 106, it will beapparent that the functions can be performed by the mobile device 104 orthe server 106 alone or in combination. The mobile device 104 includes acamera 105 to capture a stream of images that include an object ofinterest 102.

As described in more detail below, the mobile device 104 can send animage from the image stream captured by the camera 105 to the server 106for processing. In another embodiment, the mobile device 104 can processthe image itself. The server 106 is coupled with a storage device 110that stores audio clips, image profiles, etc. that the server can use toidentify an object of interest 102 and create a customized audio segmentrelated to the object of interest 102. The process for creating thecustomized audio segment is described in more detail below withreference to FIG. 4.

FIG. 2 is a block diagram of an example mobile device that can be usedto implement some or all of the techniques introduced here. The mobiledevice described here is an illustration of one type of mobile device inwhich the techniques can be implemented; other mobile devices may alsobe used for implementing the techniques. For example, mobile devices mayinclude cell phones, personal digital assistants (PDAs), portable emaildevices (e.g., a Blackberry® device), portable media players (e.g., anApple iPod Touch®), tablet or slate computers (e.g., an Apple iPad®),netbook computers, notebook computers, e-readers, or any other similardevice. While the description here is directed to a mobile device, oneof ordinary skill would recognize that the process could be performed bya larger computing device, for example a desktop computer, a gameconsole (e.g., Microsoft XBox® or Sony PS3®), or a television/set-topbox with sufficient audio and video processing capabilities.

In the example of FIG. 2, the device is a processing system thatincludes a processor subsystem 202 that can include one or moreprocessors. The device further includes a memory 204, a storage module210, an input device 212, a display device 214, and a network module 216(e.g., an antenna system to connect to a wireless network) eachinterconnected by an interconnect 206 and powered by a power supply 209.In one embodiment, the power supply is an external power supply. Thedevice includes a speaker 220, a microphone 222, and a camera 105.

The display device 214 is configured to display information for viewing.Information for display can consist of textual, graphical, and/ormultimedia information and is presentable in a graphical user interface.In some embodiments, the display 214 includes a touch-sensitive screenthat allows for the direct manipulation of displayed information. Insome embodiments, the displayed information can be manipulated by theinput device 212. The input device 212 is configured to produce a signalbased on user input. The input device 212 can be, for example, akeyboard, mouse, trackball, touch-sensitive screen, or any other inputdevice capable of communicating a user selection. The signal may includea user selection that conveys the user input to the processor subsystem202, via the interconnect 206.

The memory 204 illustratively comprises storage locations that can beaddressed by the processor subsystem 202 and the devices othercomponents for storing software program code and data structuresassociated with the present invention. The processor subsystem 202 andthe associated components may, in turn, include processing elementsand/or logic circuitry configured to execute the software code andmanipulate the data structures. The code 208, portions of which aretypically resident 204 in memory and executed by the processor subsystem202, implements the communication operations according to the techniquesintroduced here. It will be apparent to those skilled in the art thatother processing and memory implementations, including various computerreadable storage media, may be used for storing and executing programinstructions pertaining to the techniques introduced here.

FIG. 3 is a block diagram of an example server system. In anillustrative embodiment, the server system 106 includes a processorsubsystem 310 that includes one or more processors. The server system106 further includes memory 320, a network adapter 340, and a storageadapter 350, all interconnected by an interconnect 360.

The memory 320 illustratively comprises storage locations that areaddressable by the processor(s) 310 and adapters 340 and 350 for storingsoftware program code and data associated with the techniques introducedhere. The processor 310 and adapters 340 and 350 may, in turn, compriseprocessing elements and/or logic circuitry configured to execute thesoftware code and manipulate the data structures. It will be apparent tothose skilled in the art that other processing and memoryimplementations, including various computer readable storage media, maybe used for storing and executing program instructions pertaining to thetechniques introduced here.

The network adapter 340 includes a plurality of ports to couple theserver system 106 with one or more other systems over point-to-pointlinks, wide area networks, virtual private networks implemented over apublic network (Internet) or a shared local area network. The networkadapter 340 thus can include the mechanical components and electricalcircuitry that allows the server system 106 to connect with the mobiledevice 104 over the network 108. One or more server systems cancommunicate with other systems, such as the mobile device 104, byexchanging messages, for example, using packets or frames of dataaccording to pre-defined protocols.

The storage adapter 350 cooperates with the operating system to accessinformation on attached storage devices, such as storage device 110. Theinformation may be stored on any type of attached array of writablestorage media, such as magnetic disk or tape, optical disk (e.g., CD-ROMor DVD), flash memory, solid-state drive (SSD), electronic random accessmemory (RAM), micro-electro mechanical and/or any other similar mediaadapted to store information, including data and parity information. Thestorage adapter 350 includes a plurality of ports having input/output(I/O) interface circuitry that couples with the storage device over anI/O interconnect arrangement. In one embodiment, remote storage devicescan be used to supplement and/or replace local storage. The system canconnect to the remote storage devices using network adapter 340 or aspecialized storage adapter (e.g., to connect to cloud-based storagesystems like Amazon S3).

FIG. 4 is a flow diagram of a process for creating customized audiocontent relating to an object of interest. In one embodiment, the objectof interest 102 is a toy that a user is playing with. The mobile device104, using the techniques introduced here, can create an additionallevel of interaction with the toy by creating customized audio segmentsbased on the identity of the toy and/or the user's interaction with thetoy. Thus, the mobile device 104 can create an interactive environmentfor a child playing with the otherwise inanimate toy.

The process begins at step 402 with the camera 105 of the mobile device104 capturing a stream of images that include the object of interest102. In one embodiment, the mobile device 104 transmits the image streamto the server 106 over network 108. In another embodiment, the imagestream can be processed locally by the mobile device 104.

At step 404, after capturing the image stream, the mobile device 104locates the object of interest 102 in the image stream. The mobiledevice 104 can locate the object of interest 102 in the image stream bytracking a user's interactions with an object in the image stream anddetermining based on the user's interactions that the object is anobject of interest 102, for example. In other examples, the user canperform a certain specified motion with the object of interest 102 thatthe mobile device 104 can detect, such as moving the object in anup-and-down or side-to-side motion. In yet another example, the mobiledevice 104 can locate the object of interest 102 through a user'sinteraction with the mobile device 104 (e.g., the user can use thetouch-sensitive screen or other input device 212 of the mobile device104 to select the object of interest 102 from the image stream or theuser can provide the mobile device 104 with a spoken command to initiatean automated process for locating the object of interest 102). Furtherexamples include the mobile device 104 automatically locating the objectof interest 102 using computer vision.

Once the mobile device 104 has located the object of interest 102 in theimage stream, at step 406, the mobile device 104 can identify the objectof interest 102. In another embodiment, the mobile device 104 can sendan image from the image stream to the server 106 for the server 106 toidentify the object of interest 102. Identifying the object of interest102 can be performed using any one of a number of known imagerecognition algorithms. Continuing with the example from above where theobject of interest 102 is a toy, the mobile device 104 can identify thetoy as a popular television/movie character. In one embodiment, multipleobjects of interest in the image stream can be located and identified.

However, the techniques introduced here are not limited to a database ofpopular toys based on television/movie characters. In one embodiment, anobject profile can be created for any object that a user is interactingwith. For example, a user can create an object profile for a drawingthat the user has made, a less popular toy that may not be included in adatabase of popular toys, another human, or any other regular householditem (e.g., a blender, a piece of fruit, a mop, etc.). Further, if themobile device 104 is not able to identify the object of interest 102,the user may be prompted to create an object profile for the object ofinterest 102. The process for creating and/or updating the objectprofile of an object of interest is described in more detail below withreference to FIG. 5.

In one embodiment, after the mobile device 104 has identified the objectof interest 102, at step 408, the mobile device 104 can optionally trackthe object of interest 102 and the user's interaction with the object ofinterest 102. Continuing the example of a child playing with a toy, themobile device 104, using the microphone 222 along with the camera 105,can capture how the child is playing with the toy and base futureprocessing, for example, creating customized audio segments based on thechild's interaction with the toy. For example, if the mobile device 104detects that the child is pretending that the toy is flying, the mobiledevice 104 can create a customized audio segment that expresses thetoy's actions and/or possible emotions. In another embodiment, thecustomized audio segments can be created based on spoken or typed userinput. In one embodiment, the mobile device detects the user's emotionin speaking the input and the customized audio segment is created based,at least in part, on the user's emotion.

After the object of interest 102 has been identified, and in someexamples the user's interaction with the object has been tracked, themobile device 104, at step 410, creates a customized audio segmentrelated to the object of interest 102. The customized audio segment, insome examples, is created by compiling a plurality of audio clips intothe customized audio segment. The audio clips can be, for example,licensed audio clips, user recorded audio clips, crowd-sourced audioclips, and/or computer generated audio clips. Thus, the customized audiosegments can be dynamically created to correspond to any number ofsituations based on the identity of an object of interest, a user'scurrent interaction with the object of interest, and/or the history of auser's interactions with the object of interest. In one embodiment,where there are multiple objects of interest present in the imagestream, customized audio segments for each of the objects of interestcan be created to simulate conversation/interaction between the objectsof interest.

In the example of a child playing with a popular toy based on atelevision/movie character, the licensed audio clips can be in the voiceof the character. In one embodiment, the licensed audio content can bepaid for by the user at the time a toy is recognized and associated withthe licensed audio content. In another embodiment, the price of thelicensed content can be included in the overall cost of the softwareand/or hardware implementing the techniques introduced here. Otherwise,a parent, child, and/or a larger community of users can record their ownaudio clips that can be stored and later compiled by the mobile device104 into the customized audio segment. In one embodiment, the mobiledevice 104 stores in a local memory audio clips of interjections, forexample, “uh”, “um”, and the like, that can be played while the mobiledevice 104 creates the customized audio segments.

At step 412, after the mobile device 104 has created the customizedaudio segment and/or an interjection has stopped playing, the mobiledevice 104 can play the customized audio segment. The process ofcreating and playing customized audio segments based on user interactionwith the object of interest can be a continuous process as shown in FIG.4. As described above, the process is organized as a sequence ofoperations in the flowchart. However, it should be understood that inthis process, and the other processes described herein, at least some ofthe operations associated with the process potentially can be reordered,supplemented, or substituted for, while still performing the sameoverall technique.

As described above, the database of objects of interest is not likely tobe all encompassing and many objects of interest will not be initiallyincluded. Thus, a method for creating an object profile, including anaudio profile and an image profile, for an object of interest is nowdescribed with reference to FIG. 5. The process begins with step 502,where the camera 105 of the mobile device 104 captures a stream ofimages. Capturing the stream of images is described in more detail abovewith reference to FIG. 4. The process continues, at step 504, with themobile device 104 locating the object of interest 102 similar to thedescription above.

As described above, once the object of interest 102 has been located, atstep 506 the mobile device 104 attempts to identify the object ofinterest 102 as an object that has an object profile already created. Ifthe mobile device 104 is not able to identify the object of interest102, at step 508, the mobile device 104 prompts the user to begin asetup process to create an object profile for the object. In oneembodiment, the prompts can be audio prompts. For example, if the mobiledevice 104 detects a child playing with a toy that the mobile device 104is not able to recognize, the mobile device 104 can pose a question,such as “What are you playing with?”, to the child to initiate the setupprocess. In another embodiment, the setup prompt can be a popup on thedisplay 214 of the mobile device 104.

In one embodiment, at step 510, the mobile device 104 can prompt thechild to hold the toy up so that the mobile device 104 can capture aclear image of the object. The mobile device 104 can use imageprocessing software to build an image profile of the toy so that the toycan be recognized in later interactions. It is understood that at anypoint during the setup process, if the mobile device 104 is able torecognize the object of interest 102 as an object that already has anobject profile created, the setup process can be ended and the mobiledevice 104 can begin to create customized audio segments related to theidentified object of interest 102.

Once the mobile device 104 has created an image profile for the objectof interest 102, at step 512, the mobile device 104 can create an audioprofile for the object. In one embodiment, creating an audio profile caninclude prompting the user, for example, the child playing with the toyor a parent of the child, to record audio clips that can be used tocreate the customized audio segment. In another embodiment, the mobiledevice 104 can prompt the user to select a pre-recorded set of audioclips to associate with the object. For example, if a child ispretending that a toy is a particular character from a movie ortelevision, the child, or a parent, can select a licensed audio packagethat is in the voice of that character. In another embodiment, themobile device 104 can select a generic audio package that fits thecharacteristics of the toy, for example, a soft, cuddly voice for ateddy bear. In yet another embodiment, the user or the mobile device 104can associate the toy with a computer generated voice that can be usedto create the customized audio segments. In one embodiment, the audioprofile can be created separately from, and later linked to, an objectprofile that is created by the user.

The information that is gathered and/or created during the setupprocess, for example, the image profile, recorded audio clips, or thelike, can be stored by the mobile device 104 in storage 210 for use increating customized audio segments. In another embodiment, theinformation can be stored in the storage 110 coupled with server 106 andaccessed by the mobile device 104 over network 108.

If, at step 506, the mobile device 104 can identify the object ofinterest 102, the process continues, at step 514, with creatingcustomized audio segments as described above with reference to FIG. 4.However, at any time while creating the customized audio segments, theuser can initiate the setup process. At step 516, as long as the userdoes not initiate the setup process, for example, by a verbal command orinteraction with the display 214 and/or input device 212, the customizedaudio content creation process, step 514, continues. However, inresponse to a user's command to enter setup, at step 516, the mobiledevice 104 allows the user to create new and/or modify existing audiocontent related to the object of interest 102.

The techniques and elements introduced above can be implemented byprogrammable circuitry programmed or configured by software and/orfirmware, or they can be implemented by entirely by special-purpose“hardwired” circuitry, or in a combination of such forms. Suchspecial-purpose circuitry (if any) can be in the form of, for example,one or more application-specific integrated circuits (ASICs),programmable logic devices (PLDs), field-programmable gate arrays(FPGAs), etc. Further, the term “processor” encompasses all kinds ofapparatus, devices, and machines for processing data, including by wayof example a programmable processor, a computer, a system on a chip, ormultiple ones, or combinations, of the foregoing.

Software or firmware for use in implementing the techniques introducedhere may be stored on a machine-readable storage medium and may beexecuted by one or more general-purpose or special-purpose programmablemicroprocessors. A “machine-readable medium”, as the term is usedherein, includes any mechanism that can store information in a formaccessible by a machine (a machine may be, for example, a computer,network device, cellular phone, personal digital assistant (PDA),manufacturing tool, any device with one or more processors, etc.). Forexample, a machine-accessible medium includes recordable/non-recordablemedia (e.g., read-only memory (ROM); random access memory (RAM);magnetic disk storage media; optical storage media; flash memorydevices; etc.), etc. Moreover, while machine-readable storage medium isnot a propagated signal, a machine-readable storage medium can be asource or destination of computer program instructions encoded in anartificially-generated propagated signal.

The term “logic”, as used herein, can include, for example,special-purpose hardwired circuitry, software and/or firmware inconjunction with programmable circuitry, or a combination thereof.

Embodiments of the subject matter described in this specification can beimplemented in a computing system that includes a back-end component,e.g., as a data server, or that includes a middleware component, e.g.,an application server, or that includes a front-end component, e.g., aclient computer having a graphical user interface or a Web browserthrough which a user can interact with an implementation of the subjectmatter described in this specification, or any combination of one ormore such back-end, middleware, or front-end components. The componentsof the system can be interconnected by any form or medium of digitaldata communication, e.g., a communication network.

The computing system can include any number of clients and servers. Aclient and server are generally remote from each other and typicallyinteract through a communication network. The relationship of client andserver arises by virtue of computer programs running on the respectivecomputers and having a client-server relationship to each other. In someembodiments, a server transmits data (e.g., an HTML page) to a clientdevice (e.g., for purposes of displaying data to and receiving userinput from a user interacting with the client device). Data generated atthe client device (e.g., a result of the user interaction) can bereceived from the client device at the server.

Although the present invention has been described with reference tospecific exemplary embodiments, it will be recognized that the inventionis not limited to the embodiments described, but can be practiced withmodification and alteration within the spirit and scope of the appendedclaims. Accordingly, the specification and drawings are to be regardedin an illustrative sense rather than a restrictive sense.

1. A method comprising: capturing, by a camera of a mobile device, at least one image including a toy; transmitting the at least one image to a server; receiving, in response to transmitting the at least one image to the server, a customized audio segment relating to the toy; and playing, by the mobile device, the customized audio segment.
 2. The method of claim 1 further comprising: locating the toy in the at least one image; detecting, by the mobile device, an identity of the toy; and transmitting the identity of the toy to the server.
 3. The method of claim 2, wherein the customized audio segment is related to the identity of the toy.
 4. The method of claim 1, wherein a plurality of pre-recorded audio segments are compiled to form the customized audio segment.
 5. The method of claim 2, wherein locating the toy in the at least one image includes receiving, from a user, an indication that an object in the at least one image is the toy.
 6. A method comprising: receiving, at a server from a mobile device, at least one image including an object of interest; creating a customized audio segment relating to the object of interest; and transmitting the customized audio segment to the mobile device.
 7. The method of 6 further comprising: locating the object of interest in the at least one image; and detecting an identity of the object of interest.
 8. The method of claim 7, wherein creating a customized audio segment includes, compiling stored audio clips into the customized audio segment based on the identity of the object of interest.
 9. The method of claim 7 further comprising: receiving a user indication of the object of interest in the at least one image, wherein locating the object of interest is based on the user indication.
 10. The method of claim 6, wherein the object of interest is a toy.
 11. The method of claim 6, wherein the object of interest is a human.
 12. The method of claim 6, wherein the object of interest is a regular household item.
 13. A method comprising: capturing, by a camera of a mobile device, a video stream including an object of interest; locating the object of interest in the video stream; identifying the object of interest; creating a customized audio segment based on an identity the object of interest; and playing the customized audio segment.
 14. The method of claim 13 further comprising, playing additional customized audio segments for the object of interest based on further user interactions.
 15. The method of claim 14, wherein the further user interactions include text or voice input.
 16. The method of claim 14, wherein the further user interactions include physical motion of the user, the object of interest, or the mobile device.
 17. The method of claim 14, wherein the further user interactions include emotions expressed by the user that are subsequently detected by the mobile device.
 18. The method of claim 13, wherein the customized audio segment is generated, at least in part, based on a history of user interaction with the object of interest.
 19. The method of claim 13 further comprising: locating a second object of interest in the object stream; identifying the second object of interest; creating a second customized audio segment, such that the customized audio segment and the second customized audio segment simulate a conversation between the object of interest and the second object of interest; and playing the second customized audio segment.
 20. The method of claim 13, wherein the customized audio segment is generated based on a plurality of objects of interest.
 21. The method of claim 13, wherein creating a customized audio segment includes, compiling stored audio clips into the customized audio segment based on the identity of the object of interest.
 22. The method of claim 13, wherein the customized audio segment is related to the identity of the object of interest.
 23. The method of claim 13, wherein locating the toy in the at least one image includes receiving, from a user, an indication that an object in the at least one image is the object of interest.
 24. The method of claim 13 further comprising, recording a plurality of audio clips, wherein creating a customized audio segment includes compiling a set of the plurality of audio clips into the customized audio segment.
 25. The method of claim 13 further comprising, playing an interjection audio clip while the customized audio segment is being created.
 26. A system comprising: a processor; and a memory coupled with the processor, the memory storing instructions which when executed by the processor cause the system to perform a plurality of operations, including: receiving at least one image including an object of interest; locating the object of interest in the at least one image; identifying the object of interest; and creating a customized audio segment based on the identity the object of interest.
 27. The system of claim 26 wherein the plurality of operations further includes, playing the customized audio segment.
 28. The system of claim 26 further comprising, a user interface configured to display the at least one image and detect a user input wherein, locating the object of interest in the at least one image includes detecting a user input that indicates an object in the at least one image is the object of interest.
 29. The system of claim 26 further comprising, a storage device configured to store a plurality of audio clips, wherein creating the customized audio segment includes compiling a set of the plurality of audio clips into the customized audio segment.
 30. The system of claim 26 further comprising: a microphone configured to receive a plurality of audio clips; and a storage device to store the plurality of audio clips, wherein creating the customized audio segment includes compiling a set of the plurality of audio clips into the customized audio segment.
 31. A method comprising: receiving at least one image including an object of interest; locating the object of interest in the at least one image; identifying the object of interest; and creating a customized audio segment based on the identity the object of interest.
 32. The method of claim 31, wherein creating a customized audio segment includes, compiling stored audio clips into the customized audio segment based on the identity of the object of interest.
 33. The method of claim 31, wherein the customized audio segment is related to the identity of the object of interest.
 34. The method of claim 31, wherein locating the toy in the at least one image includes receiving, from a user, an indication that an object in the at least one image is the object of interest.
 35. The method of claim 31 further comprising, recording a plurality of audio clips, wherein creating a customized audio segment includes compiling a set of the plurality of audio clips into the customized audio segment.
 36. The method of claim 31 further comprising, playing the customized audio segment.
 37. The method of claim 31 further comprising, playing an interjection audio clip while the customized audio segment is being created.
 38. The method of claim 31, wherein the customized audio segment is generated by a mobile device at run time.
 39. The method of claim 31, wherein the customized audio segment is generation by a server at run time and transmitted to a mobile device. 